ABSTRACT 

A language «model generation and accumulation apparatus 
(10) that generates and accumulates language models for speech 
recognition is comprised of: a higher-level N-gram generation and 
5 accumulation unit (11) that generates and accumulates a 
higher-level N-gram language model obtained by modeling each of a 
plurality of texts as a string of words including a word string class 
having a specific linguistic property; and a lower class dependent 
word N-gram generation and accumulation unit (12) that generates 
10 and accumulates a lower-level N-gram language model obtained by 
modeling a sequence of words included in each word string class. 
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(57» Abstract: A language model crcalion/accumtilaliou device I Mil for creating aikl accumulating :i language model for speech 
recognition iiicluilos: an upper node class Ngrani creation/accumulation section (III lor crcalinj! ami accumulating an upper inxJe 
Ngram language model obtained by modeling a plurality of lexis as a word siring containing a word siring cluster liavinj; a par- 
ticular language cliaraclerislic: and a lower node class dcpendcnl w«iril N grani ca'alion/accuuiulalion section ( I2l for creating and 
accumulating a lower node N grani language model obtained by modeling a word siring in a wool siring cluster. 
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